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The present invention relates generally to the field of structured 
data documents and more particularly to a method of performing a 
search of a numerical document object model (DOM). 



Background of the In ve ntion 

Structured data documents such as HTML (Hyper Text Markup 
Language), XML (extensible Markup Language) and SGML (Standard 
^ 10 Generalized Markup Language) documents and derivatives use tags to 
=3 describe the data associated with the tags. This has an advantage over 

:n databases in that not all the fields are required to be predefined. XML is 

M* presently finding widespread interest for exchanging information 

u between businesses. XML appears to provide an excellent solution for 

^ 15 internet business to business applications. Unfortunately, XML 

documents require a lot of memory and therefore are time consuming 
W and difficult to search. 

M» Thus there exists a need for a method of performing a search of a 

numerical document object model that reduces the memory and speeds 
20 up the time to search structured data documents. 
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Brief Description of the Drawings 



FIG. 1 is an example of an XML document in accordance with one 
embodiment of the invention; 
5 FIG. 2 is an example of a flattened data document in accordance with 

one embodiment of the invention; 

FIG. 3 is a block diagram of a system for storing a flattened data 
document in accordance with one embodiment of the invention; 

FIG. 4 shows two examples of a map store cell in accordance with one 
10 embodiment of the invention; 
Q FIG. 5 is a flow chart of a method of storing a structured data document 

Cj in accordance with one embodiment of the invention; 

FIG. 6 is a flow chart of a method of storing a structured data document 
:^ in accordance with one embodiment of the invention; 
15 ^ FIG. 7 is a flow chart of a method of storing a structured data document 

P in accordance with one embodiment of the invention; 

I'll FIG. 8 is a block diagram of a system for storing a flattened structured 

p data document in accordance with one embodiment of the invention; 

FIG. 9 is a block diagram of a system for storing a flattened structured 
20 data document in accordance with one embodiment of the invention; 

FIG. 10 is a flow chart of the steps used in a method of storing a 
flattened structured data document in accordance with one embodiment of 
the invention; 

FIG. 11 is a flow chart of the steps used in a method of storing a 
25 flattened structured data document in accordance with one embodiment of 
the invention; 

FIG. 12 is a schematic diagram of a method of storing a numerical 
document object model in accordance with one embodiment of the invention; 



FIG. 13 shows several examples of search queries of a numerical 
document object model in accordance with one embodiment of the invention; 

FIG. 14 is a flow chart of the steps used in a method of performing a 
search of a numerical document object model in accordance with one 
embodiment of the invention; 

FIG. 15 is a flow chart of the steps used in a method of performing a 
search of a numerical document object model in accordance with one 
embodiment of the invention; 

FIG. 16 is a flow chart of the steps used in a method of translating a 
structured data document in accordance with one embodiment of the 
invention; and 

FIG. 17 is a flow chart of the steps used in a method of creating an alias 
in a numerical document object model in accordance with one embodiment 
of the invention. 
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Detailed Description of the Dr aw ings 



A method of performing a search of a numerical DOM (document 
object model), includes the steps of receiving a query. When the query is 
5 a fully qualified query, the target string is transformed to form a fully 
qualified hashing code. The target string of a fully qualified query is a 
complete tag string or a complete data string. An associative lookup is 
performed in a map index using the fully qualified hashing code. A map 
offset is returned. The map offset points to a couplet that can be 
10 converted into a standard portion of a structured data document. Finally, 
□ a data couplet is returned. Since the structured data document has been 
17] converted into a numerical DOM, it requires significantly less memory to 
S| store. This significantly reduces the time to search the document. In 
p. addition, once the document has been converted into a numerical DOM, it 
is ^ is no longer necessary to look at the underlying document to perform a 
P search. 

m FIG. 1 is an example of an XML document 10 in accordance with one 

□ embodiment of the invention. The words between the <> are tags that 
describe the data. This document is a catalog 12. Note that all tags are 

20 opened and later closed. For instance <catalog> 12 is closed at the end of the 
document </catalog> 14. The first data item is "Empire Burlesque" 16. The 
tags <CD> 18 and <TITLE> 20 tell us that this is the title of the CD (Compact 
Disk). The next data entry is "Bob Dylan" 22, who is the artist. Other 
compact disks are described in the document. 

25 FIG. 2 is an example of a flattened data document (numerical document 

object model) 40 in accordance with one embodiment of the invention. The 
first five lines 42 are used to store parameters about the document. The next 
line (couplet) 44 shows a line that has flattened all the tags relating to the 



-6- 

first data entry 16 of the XML document 10, Note that the tag <ND> 46 is 
added before every line but is not required by the invention. The next tag is 
CATALOG> 47 which is the same as in the XML document 10. Then the tag CD> 
48 is shown and finally the tag TITLE> 50. Note this is the same order as the 

5 tags in the XML document 10. A plurality of formatting characters 52 are 

shown to the right of each line. The first column is the n-tag level 54. The n- 
tag defines the number of tags that closed in that line. Note that first line 44, 
which ends with the data entry "Empire Burlesque" 16, has a tag 24 (FIG. 1) 
that closes the tag TITLE. The next tag 26 opens the tag ARTIST. As a result 

10 the n-tag for line 44 is a one. Note that line 60 has an n-tag of two. This line 
q corresponds to the data entry 1985 and both the YEAR and the CD tags are 
H closed. 

^1 The next column 56 has a format character that defines whether the 

f* line is first (F) or another line follows it (N-next) or the line is the last (L). 
15 M The next column contains a line type definition 58. Some of the line types 
Q are: time stamp (S); normal (E); identification (I); attribute (A); and 
fy processing (P). The next column 62 is a delete level and is enclosed in a 
q parenthesis. When a delete command is received the data is not actually 
f " erased but is eliminated by entering a number in the parameters in a line to 
20 be erased. So for instance if a delete command is received for "Empire 

Burlesque" 16, a "1" would be entered into the parenthesis of line 44. If a 
delete command was received for "Empire Burlesque" 16 and <TITLE>, </TITLE>, 
a "2" would be entered into the parenthesis. The next column is the parent 
line 64 of the current line. Thus the parent line for the line 66 is the first line 
25 containing the tag CATALOG. If you count the lines you will see that this is 
line five (5) or the preceding line. The last column of formatting characters 
is a p-level 68. The p-level 68 is the first new tag opened but not closed. 
Thus at line 44, which corresponds to the data entry "Empire Burlesque" 16, 
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the first new tag opened is CATALOG. In addition the tag CATALOG is not 
closed. Thus the p-level is two (2). 

FIG. 3 is a block diagram of a system 100 for storing a flattened data 
document in accordance with one embodiment of the invention. Once the 
5 structured data document is flattened as shown in FIG. 2, it can be stored. 
Each unique tag or unique set of tags for each line is stored to a tag and data 
store 102. The first entry in the tag and data store is ND>CATALOG>CD>TITLE> 
104. Next the data entry "Empire Burlesque" 106 is stored in the tag and data 
store 102. The pointers to the tag and data entry in the tag and data store 
10 102 are substituted into line 44. Updated line 44 is then stored in a first cell 
□ 108 of the map store 110. In one embodiment the tag store and the data 
y store are separate. The tag and data store 102 acts as a dictionary, which 
^ reduces the required memory size to store the structured data document. 
^ Note that the formatting characters allow the structured data document to be 
is ^ completely reconstructed. 

Q FIG. 4 shows two examples of a map store cell in accordance with one 

fu embodiment of the invention. The first example 120 works as described 
JJ above. The cell (couplet) 120 has a first pointer (P x ) 122 that points to the 
^ tag in the tag and data store 102 and a second pointer (P 2 ) 124 that points to 
20 the data entry. The other information is the same as in a flattened line such 
as: p-level 126; n-tag 128; parent 130; delete level 132; line type 134; and 
line control information 136. The second cell type 140 is for an insert. 
When an insert command is received a cell has to be moved. The moved cell 
is replaced with the insert cell 140. The insert cell has an insert flag 142 and 
25 a jump pointer 144. The moved cell and the inserted cell are at the jump 
pointer. 

FIG. 5 is a flow chart of a method of storing a structured data 
document. The process starts, step 150, by receiving the structured data 
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document at step 152. A first data entry is determined at step 154. In one 
embodiment, the first data entry is an empty data slot. At step 156 a first 
plurality of open tags and the first data entry is stored which ends the 
process at step 158. In one embodiment a level of a first opened tag is 
5 determined. The level of the first opened tag is stored. In another 

embodiment, a number of consecutive tags closed after the first data entry is 
determined. This number is then stored. A line number is stored. 

In one embodiment, a next data entry is determined. A next plurality of 
open tags proceeding the next data entry is stored. These steps are repeated 
10 until a next data entry is not found. Note that the first data entry may be a 
^ null. A plurality of format characters associated with the next data entry are 
2 also stored. In one embodiment the flattened data document is expanded 
Ml into the structured data document using the plurality of formatting 
O characters. 

15 '„ FIG. 6 is a flow chart of a method of storing a structured data 

document. The process starts, step 170, by flattening the structured data 
!J[ document to a provide a plurality of tags, a data entry and a plurality of 
|==f format characters in a single line at step 172. At step 174 the plurality of 
tags, the data entry and the plurality of format characters are stored which 

20 ends the process at step 176. In one embodiment, the plurality of tags are 

stored in a tag and data store. In addition, the plurality of format characters 
are stored in map store. The data entry is stored in the tag and data store. A 
first pointer in the map store points to the plurality of tags in the tag and 
data store. A second pointer is stored in the map store that points to the 

25 data store. In one embodiment, the structured data document is received. A 
first data entry is determined. A first plurality of open tags preceding the 
first data entry and the first data entry are placed in a first line. A next data 
entry is determined. A next plurality of open tags proceeding the next data 
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entry is placed in the next line. These steps are repeated until a next data 
entry is not found. In one embodiment a format character is placed in the 
first line. In one embodiment the format character is a number that indicates 
a level of a first tag that was opened. In one embodiment the format 

5 character is a number that indicates a number of tags that are consecutively 
closed after the first data entry. In one embodiment the format character is a 
number that indicates a line number of a parent of a lowest level tag. In one 
embodiment the format character is a number that indicates a level of a first 
tag that was opened but not closed. In one embodiment the format character 

10 is a character that indicates a line type. In one embodiment the format 
^ character indicates a line control information. In one embodiment the 

'sic?- 

;j structured data document is an extensible markup language document. In 
CO one embodiment the next data entry is placed in the next line. 
□ FIG. 7 is a flow chart of a method of storing a structured data 

15 , document. The process starts, step 180, by flattening the structured data 
J document to contain in a single line a tag, a data entry and a formatting 
\^ character at step 182. The formatting character is stored in a map store at 
W step 184. At step 186 the tag and the data entry are stored in a tag and data 
store which ends the process at step 188. In one embodiment a first pointer 
20 is stored in the map store that points to the tag in the tag and data store. A 
second pointer is stored in the map store that points to the data entry in the 
tag and data store. In one embodiment a cell is created in the map store for 
each of the plurality of lines in a flattened document. A request is received 
to delete one of the plurality of data entries. The cell associated with the one 
25 of the plurality of data entries is determined. A delete flag is set. Later a 

restore command is received. The delete flag is unset. In one embodiment, a 
request to delete one of a plurality of data entries and a plurality of related 
tags is received. A delete flag is set equal to the number of the plurality of 
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related tags plus one. In one embodiment, a request is received to insert a 
new entry. A previous cell containing a proceeding data entry is found. The 
new entry is stored at an end of the map store. A contents of the next cell is 
moved after the new entry. An insert flag and a pointer to the new entry is 
5 stored in the next cell. A second insert flag and second pointer is stored after 
the contents of the next cell. 

Thus there has been described a method of flattening a structured data 
document to form a numerical document object model (DOM). The process 
of flattening the structured data document generally reduces the number of 
10 lines used to describe the document. The flattened document is then stored 
P using a dictionary to reduce the memory required to store repeats of tags and 
Cj data. In addition, the dictionary (tag and data store) allows each cell in the 
r f j map store to be a fixed length. The result is a compressed document that 
S requires less memory to store and less bandwidth to transmit. 
15 ^ FIG. 8 is a block diagram of a system 200 for storing a flattened 

G structured data document (numerical DOM) in accordance with one 
ill embodiment of the invention. The system 200 has a map store 202, a 
o dictionary store 204 and a dictionary index 206. Note that this structure is 
similar to the system of FIG. 3. The dictionary store 204 has essentially the 
20 same function as the map and tag store (FIG. 3) 102. The difference is that a 
dictionary index 206 has been added. The dictionary index 206 is an 
associative index. An associative index transforms the item to be stored, 
such as a tag, tags or data entry, into an address. Note that in one 
embodiment the transform returns an address and a confirmer as explained 
25 in the United States patent application, Serial No. 09/419,217, entitled 
"Memory Management System and Method" filed on October 15, 1999, 
assigned to the same assignee as the present application and hereby 
incorporated by reference. The advantage of the dictionary index 206 is that 
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when a tag or data entry is received for storage it can be easily determined if 
the tag or data entry is already stored in the dictionary store 204. If the tag 
or data entry is already in the dictionary store the offset in the dictionary can 
be immediately determined and returned for use as a pointer in the map 
5 store 202. 

FIG. 9 is a block diagram of a system 220 for storing a flattened 
structured data document (numerical DOM) in accordance with one 
embodiment of the invention. A structured data document 222 is first 
processed by a flattener 224. The flattener 224 performs the functions 
10 described with respect to FIGs. 1 & 2 to form a numerical DOM. A parser 226 
□ then determines the data entries and the associated tags. One of the data 
%j entries is transformed by the transform generator 228. This is used to 
ijg determine if the data entry is in the associative index 230. When the data 
~ entry is not in the associative index 230, it is stored in the dictionary 232. A 
15 ^ pointer to the data in the dictionary is stored at the appropriate address in 
O the associative index 230. The pointer is also stored in a cell of the map 

PU store 234 as part of a flattened line. 

UJ 

q FIG. 10 is a flow chart of the steps used in a method of storing a 

flattened structured data document (numerical DOM) in accordance with one 

20 embodiment of the invention. The process starts, step 240, by flattening the 
structured data document to form a flattened structured data document 
(numerical DOM) at step 242. Each line of the flattened structured data 
document is parsed for a tag at step 244. Next it is determined if the tag is 
unique at step 246. When the tag is unique, step 248, the tag is stored in a 

25 dictionary store which ends the process at step 250. In one embodiment a 
tag dictionary offset is stored in the map store. A plurality of format 
characters are stored in the map store. When a tag is not unique, a tag 
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dictionary offset is determined. The tag dictionary offset is stored in the map 
store. 

In one embodiment, the tag is transformed to form a tag transform. An 
associative lookup is performed in a dictionary index using the tag transform. 
A map index is created that has a map pointer that points to a location in the 
map store of the tag. The map pointer is stored at an address of the map 
index that is associated with the tag transform. 

FIG. 11 is a flow chart of the steps used in a method of storing a 
flattened structured data document (numerical DOM) in accordance with one 
embodiment of the invention. The process starts, step 260, by receiving the 
flattened structured data document (numerical DOM) that has a plurality of 
lines (couplets) at step 262. Each of the plurality of lines contains a tag, a 
data entry and a format character. The tag is stored in a dictionary store at 
step 264. The data entry is stored in the dictionary store at step 266. At step 
268 the format character, a tag dictionary offset and a data dictionary offset 
are stored in a map store which ends the process at step 270. In one 
embodiment, the tag is transformed to form a tag transform. The tag 
dictionary offset is stored in a dictionary index at an address pointed to by 
the tag transform. In one embodiment, it is determined if the tag is unique. 
When the tag is unique, the tag is stored in the dictionary store otherwise the 
tag is not stored (again) in the dictionary store. To determine if the tag is 
unique, it is determined if a tag pointer is stored in the dictionary index at an 
address pointed to by the tag transform. 

In one embodiment, the data entry is transformed to form a data 
transform. The data dictionary offset is stored in the dictionary index at an 
address pointed to by the data transform. In one embodiment each of the 
flattened lines has a plurality of tags. 
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In one embodiment, a map index is created. Next it is determined if the 
tag is unique. When the tag is unique, a pointer to a map location of the tag 
is stored in the map index. When the tag is not unique, it is determined if a 
duplicates flag is set. When the duplicates flag is set, a duplicates count is 
incremented. When the duplicates flag is not set, the duplicates flag is set. 
The duplicates count is set to two. In one embodiment a transform of the tag 
with an instance count is calculated to form a first instance tag transform and 
a second instance tag transform. A first map pointer is stored in the map 
index at an address associated with the first instance transform. A second 
map pointer is stored in the map index at an address associated with the 
second instance transform. 

In one embodiment a transform of the tag with an instances count equal 
to the duplicates count is calculated to form a next instance tag transform. A 
next map pointer is stored in the map index at an address associated with the 
next instance transform. 

In one embodiment, a map index is created. Next it is determined if the 
data entry is unique. When the data entry is unique, a pointer to a map 
location of the tag is stored. 

Thus there has been described an efficient manner of storing a 
structured data document that requires significantly less memory than 
conventional techniques. The associative indexes significantly reduces the 
overhead required by the dictionary. 

FIG. 12 is a schematic diagram of a method of storing a numerical 
document object model in accordance with one embodiment of the 
invention. This is similar to the models described with respect to FIGs. 3 
& 8. The couplets (flattened lines) are stored in the map store 302. A tag 
dictionary 304 stores a copy of each unique tag string. For instance, the 
tag string CATALOG>CD>TITLE> 306 from line 44 (see FIG. 2) is stored in the 
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tag dictionary 304. Note that the tag ND> is associated with every line and 
therefor has been ignored for this discussion. A tag dictionary index 308 
is created. Every tag, incomplete tag string and complete tag string is 
indexed. As a result the tag CATALOG> 310, CATALOG>CD> 312 and every 
other permutation is stored in the tag index 308. Since a tag may occur in 
multiple entries it may have a number of pointers associated with the tag 
in the index. 

A data dictionary 314 stores a copy of each unique data entry such 
as "Bob Dylan". A data dictionary index 316 associates each data entry 
with its location in the dictionary. In one embodiment, the tag dictionary 
index and the data dictionary index are associative memories. Thus a 
mathematical transformation of the entry such as "Bob Dylan" provides 
the address in the index where a pointer to the entry is stored. In 
addition to the tag and data indices a map index 318 is created. The map 
index 318 contains an entry for every complete tag string (see string 306) 
and the complete tag string and associated data entry. Note that the map 
index may be an associative index. By creating these indices and 
dictionaries it is possible to quickly and efficiently search a structured 
data document. In addition, once the document is in this form it is 
possible to search for a data entry without ever having to look at the 
original document. 

FIG. 13 shows several examples of search queries of a numerical 
document object model in accordance with one embodiment of the 
invention. The first example 330 is a fully qualified query since a 
complete tag string has been specified. The second example 332 is also a 
fully qualified query since a complete tag string and a complete data entry 
have been specified. The third example is a not fully qualified query since 
a partially complete tag string has been specified. The fourth 336 and 
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fifth 338 examples are also examples of a not fully qualified query since 
the data entry is not complete. Note that the * stands for any wild card. 
If the data entry were completely specified, the query would be fully 
qualified. 

FIG. 14 is a flow chart of the steps used in a method of performing a 
search of a numerical document object model in accordance with one 
embodiment of the invention. The process starts, step 350, by receiving a 
query at step 352. When the query is a fully qualified query, the target is 
transformed to form a fully qualified hashing code at step 354. Note the 
phrase "fully qualified hashing code" means the hashing code for the 
target of a fully qualified query. In one embodiment the hashing code a 
mathematical transformation of the target to produce an address and a 
confirmer as explained in the United States patent application, Serial No. 
09/419,217, entitled "Memory Management System and Method" filed on 
October 15, 1999, assigned to the same assignee as the present 
application and hereby incorporated by reference. An associative lookup 
in a map index is performed using the fully qualified hashing code at step 
356. At step 358, a map offset is returned. At step 360, a data couplet is 
returned which ends the process at step 362. In one embodiment, an 
identified couplet of the numerical DOM (as stored in the map) is 
converted into an XML string. When the query is partially qualified, the 
target is transformed to form a partially qualified hashing code. An 
associative lookup is performed in a dictionary index using the partially 
qualified hashing code. A partially qualified query is one that does not 
contain a complete tag or data string, i.e, <TITLE> instead of 
ND>CATALOG>CD>TITLE>, of <Dylan> instead of <Bob Dylan>. A 
dictionary offset is returned. The complete string is located in the 
dictionary, using the dictionary offset. A pointer is located in a map index 
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using the complete string. The complete reference is located in the 
numerical DOM using the pointer. The data couplet is converted into a 
data XML string. 

In another embodiment, when the query includes a wildcard target, 
the dictionary is scanned for the wildcard target. A complete string is 
returned from the dictionary that contains the wildcard target. A pointer 
is located in a map index using the complete string. A couplet is located 
in the numerical DOM using the pointer. 

In one embodiment the hashing code is determined using linear 
feedback shift register operation, such as (but not limited to) a cyclical 
redundancy code. In another embodiment, the hashing code is 
determined by using a modulo two polynomial division. In one 
embodiment, the divisor polynomial is an irreducible polynomial. Other 
hashing codes may also be used. 

FIG. 15 is a flow chart of the steps used in a method of performing a 
search of a numerical document object model in accordance with one 
embodiment of the invention. The process starts, step 370, by receiving a 
query at step 372. A target type of the query is determined at step 374. 
When the target type is an incomplete data string, a sliding window search of 
a dictionary is performed at step 376. An incomplete data string could be 
<Bob> instead of <Bob Dylan>. A dictionary offset of a match is returned at 
step 378. In one embodiment a plurality of dictionary offsets are returned. 
At step 380 an incomplete data couplet is returned which ends the process at 
step 382. When the target type is an incomplete tag and a complete data 
string, the incomplete tag is transformed to form an incomplete target 
hashing code. An associative lookup in a map index is performed using the 
incomplete tag hashing code. At least one map offset is returned. The 
complete data string is transformed to form a complete data string hashing 
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code. An associative lookup is performed in the map index. A data string 
map offset is returned. Next, the at least one map offset is compared with 
the data string map offset. 

FIG. 16 is a flow chart of the steps used in a method of translating a 
structured data document in accordance with one embodiment of the 
invention. The process starts, step 390, by creating a numerical DOM of the 
structured data document at step 392. A first format dictionary is translated 
into a second format dictionary at step 394. At step 396 a second set of 
dictionary pointers are added to the dictionary index. The second set of 
dictionary pointers point to the offsets in the second format dictionary which 
ends the process at step 398. In one embodiment, a plurality of dictionary 
offset pointers are converted to a plurality of dictionary index pointers. This 
converts the map so it points to the dictionary index rather than the offsets 
into the dictionary, since there are two dictionaries now. 

FIG. 17 is a flow chart of the steps used in a method of creating an alias 
in a numerical document object s model in accordance with one embodiment 
of the invention. The process starts, step 410, by receiving an alias request at 
step 412. A dictionary offset for the original string in a dictionary is found at 
step 414. At step 416 the original string is converted to the alias at the 
dictionary offset which ends the process at step 418. An alias index is 
created that associates the alias and the original string or the dictionary 
offset of the original string, and in one embodiment the creation of the alias 
index includes creating an array that matches the dictionary offset to the 
original string. In another embodiment, the original string is transformed to 
form a string hashing code. An associative lookup in the dictionary is 
performed to find the dictionary offset. 

A method of performing a search of a numerical document object 
model begins when the system receives a query. The query is transformed to 
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form a fully qualified hashing code. An associative lookup is performed in a 
map index using the fully qualified hashing code. Finally, a map offset is 
returned. In one embodiment, an identified couplet of the numerical DOM is 
converted into an XML string. In another embodiment, it is determined if the 
5 target is a complete data string. When the target is a complete data string, 

the complete data string is transformed to form a complete hashing code. An 
associative lookup is performed in a dictionary index using the complete data 
hashing code. A dictionary offset is returned. The numerical DOM is 
scanned for the dictionary offset, and a data couplet is returned. In another 
10 embodiment the data couplet is converted into a data XML string. In another 
% embodiment, the system determines if the target is a wildcard data string. 
^ When the target is the wildcard data string, performing a sliding window 
}3 search of a dictionary. The system returns a dictionary offset of a match and 
Q scans the numerical DOM for the dictionary offset. An incomplete data 
15 5! couplet is returned. 

i;i Thus there has been described a method of performing searches of a 

jrj numerical DOM that is significantly more efficient than searching the 
; :J structured data document. 

The methods described herein can be implemented as computer- 
20 readable instructions stored on a computer-readable storage medium that 
when executed by a computer will perform the methods described herein. 

While the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alterations, modifications, and 
variations will be apparent to those skilled in the art in light of the foregoing 
25 description. Accordingly, it is intended to embrace all such alterations, 
modifications, and variations in the appended claims. 



